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Abstract 

We prove the existence of a limit distribution for the normalized normality measure 
M{Efq) / ^/N (as N —¥ oo) for random binary sequences En, by this means confirming a 
conjecture of Alon, Kohayakawa, Mauduit, Moreira and Rodl. The key point of the proof 
is to approximate the distribution of the normality measure by the exiting probabilities 
of a multidimensional Wiener process from a certain polytope. 

1 Introduction and statement of results 

In a series of papers starting in 1997, Mauduit and Sarkozy [11] introduced and studied 
several measures of pseudorandomness for finite binary sequences. In the present paper 
we will mainly be concerned with the normality measure A/"(-Ejv); however, for comparison 
the connection between our new results in the present paper and earlier results for other 
pseudorandomness measures will be described in Section [2] below. Let a finite binary sequence 
E N = (ei, . . . , e N ) € {-1, 1}^ be given. For k e N, M G N and X e {-1, l} fc , set 

T(E N , M, X) = #{n : 0<n< M, and (e n+1 , e n+k ) = X} . 

Thus T(En, M, X) counts the number of occurrences of the pattern X among the first M + k 
elements of En- The normality measure A/* (En) is defined as 



N(En) = max max max 

k X M 



M 

T(E N ,M,X)-^ 



(1) 

where the maxima are taken over k < log 2 N, X e {-1, l} fe , 1 < M < N + 1 - k. 



Alon, Kohayakawa, Mauduit, Moreira and Rodl studied the minimal [2] and typical [3j values 
of AT (En). Concerning the minimal possible value of Af(Ew), they proved 

1 - e ) logo N < min N(E N ) < 3N 1/3 (logN) 2/3 (2) 

2 J E N e{-i,i}N 
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for sufficiently large N. There is a relatively large gap between the lower and upper bound 
in ([5]), but apparently no further progress has been made since the publication of [2] in 2006. 
Thus the questions asking for the minimal possible order of M(En) remains unsolved. For 
constructions of sequences having small normality measure, see also [T3J . 
Concerning the typical value of M(En), Alon et al. showed (improving earlier results of 
Cassaigne, Mauduit and Sarkozy [6]) that choosing En randomly from {—1, 1}^, for any 
e > there exist 61,62 > (depending on e) such that 



5WN < M(E N ) < 62VN (3) 

with probability at least 1 — e for sufficiently large N. The lower bound is optimal, since for 
any 6 > 

liminf P (M{E N ) < 6Vn) > 0. (4) 

TV— >-oo V / 



Summarized, the normality measure of a random binary sequence is typically of order v N. 
At the end of [3J, Alon et al. pose the problem of investigating the existence of a limit 
distribution of 



'N 

for random En as N — > 00, and write that it is "most likely" that such a limit distribution 
exists. The purpose of the present paper is to confirm their conjecture, and show that a limit 
distribution of Q in fact exists. 

Theorem 1. The exists a probability distribution function F such that for random En the 
distribution of M{En) / 'V~N converges to F as N — > 00. More precisely, for En having 
uniform distribution in { — 1, 1}^ for N > 1, we have for any i 6 IR that 

lim P ( < t ) = F{t) . 

The function F(t) is continuous for all t G K. 



2 Comparison with other pseudorandomness measures 

Two other measures of pseudorandomness, introduced by Mauduit and Sarkozy, are the well- 
distribution measure W{En) and the correlation measure C^En)- For MeN, a £ Z and 
b £ N set 

U(E N , M, a, b) = ^ {ea+jb ■ 1 < j < M, 1 < a + jb < N for all j}. 

The well-distribution measure W(En) is then defined as 

W(E N ) := max {\U(E N ,M, a, b)\ , where 1 < a + b and a + Mb < N} , 

hence measuring the maximal discrepancy of En along an arithmetic progression. This well- 
distribution measure can be seen as a special case of a combinatorial discrepancy measure 
(cf. |10|, Chapter 4]), but can also be directly modified into a generalization of the concept 
of discrepancy in the context of uniform distribution modulo one in analytic number the- 
ory (see [3]). 
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For k G N, M G N and D = (di, . . . , d k ) G N fe with < dt < ■ ■ ■ < d k < N we define 
V(E N ,M,D) = ^2{e n+dl ...e n+dk : 1 < n < M, n + d k < N} . 



Thus V(En, M, D) measures the correlation among k segments of En, which are relatively 
positioned according to D. The correlation measure of order k, which is denoted by Ck(E^), 
is defined as 



Note that contrary to the normality measure N{En) and the well-distribution measure 
W(En), which only depend on the sequence En, the correlation measure C^En) depends 
on an additional parameter k. 

In [3] several results concerning the typical asymptotic order of the well-distribution measure 
and the correlation measure are proved. For random En, for any e > and sufficiently large 
N the correlation measure satisfies 



with probability at least 1 — e, for all 2 < k < N/4. Furthermore, for any function k(N) < 
log N — log log N with probability at least 1 — e we have 



for sufficiently large N. In other words, the correlation measure is concentrated around its 
mean if k is "small"; consequently, the limit distribution is in this case the Dirac measure 
centered at E(Cfe). For recent results on the correlation measure, particularly concerning its 
dependence on the parameter k, see [H [3 [9]. 

For the well-distribution measure, in [3] the following results are proved: for random En, for 
any e > there exist 5\, 82 > such that 



with probability at least 1 — e, for TV" sufficiently large. The lower bound is optimal, since for 
any 5 > 



note that these two results for the well-distribution measure are similar to those for the normal- 
ity measure in (|3|) and For other recent results concerning the well-distribution measure, 
see e.g. |12l 1141 [T6] . The existence of a limit distribution of the normalized well-distribution 
measure Wn(En)/\^N of a random sequence was conjectured in [3], and confirmed in [1]. 
In [T] I wrote 

"The case of the normality measure M(En) seems to be much more difficult" 



C k (E N ) = max{\V(E N ,M,D)\ : M,D satisfy M + d k < N} . 




5!Vn <W(E n ) <s 2 Vn 
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and this is really the case. The problem is mainly caused by the fact that patterns can overlap, 
and the independence of the occurrences of the single digits (a digit meaning here one of the 
numbers —1 and 1) is not sufficient to deduce independence of the occurrences of (possibly 
overlapping) patterns of multiple digits; for example, the pattern 11 appears in the block of 
digits 111 twice, and if in a block of three random digits e\e2e^ the first two digits are 11 
there is an increased probability that the pattern 11 will also appear in the last two digits 
(namely, the probability is 1/2, while it should be 1/4 in the independent case). 
We overcome the problem by cutting the index set {1, . . . , N} into blocks of length d (for large 
d) and initially considering only the appearances of patterns entirely contained within one of 
these blocks of digits. The occurrence of a certain block B (out of 2 d possible block) can be 
interpreted as the action of a random walk on a 2 d -dimensional lattice (moving one step into 
the direction associated with this specific block); the probabilities of the normality measure 
exceeding a certain value are then asymptotically equal to the probabilities of a corresponding 
Wiener process (the limit process of the normalized random walk) leaving a certain polytope. 
Furthermore, we use decorrelation methods to show that the impact of the occurrences of 
patterns not entirely contained within a block of length d is small (for d sufficiently large) . 

3 Auxiliary results 

Lemma 1 (Maximal Bernstein inequality; see e.g. Lemma 2.2]). For a sequence £i, . . . , 
of independent, identically distributed (i.i.d.) random variables having mean zero and variance 
a 2 , and satisfying |£ n | < 1, we have for t > 



The following lemma is a special case of the multidimensional version of Donsker's theorem; 
see e.g. [151 Theorem 4.3.5]. 

Lemma 2. Let (£ n ) n >i a sequence of bounded, i.i.d., d-dimensional random vectors having 
expectation zero and covariance matrix S. For s G [0,1], set 



where Z is a d-dimensional Wiener process with mean zero and covariance matrix E, and 
=> denotes weak convergence in the d-dimensional Skorokhod space D{[0, l] d ). Z satisfies the 
equation Z = BC , where B is a d-dimensional standard Wiener process (standard Brownian 
motion) and C is a d x d matrix for which C T C = £ holds. 

Remark: The use of deep probabilistic results such as Donsker's theorem could be avoided 
by replacing the continuous parameter s £ [0, 1] by taking s from a discrete, equidistant 
grid {sq, . . . ,sjt}, which corresponds to approximating the normality measure by a variant 
for which only the values sqN, . . . , srN are allowed for M. Then we could prove Theorem Q] 
using the classical multidimensional central limit theorem for i.i.d. random vectors, and 
choosing R "large" . However, the proof is much shorter and clearer using Donsker's theorem. 





\Ns] 



Then 



Y N ^Z, 
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4 Idea of the proof and preliminaries 



For the rest of this paper, we assume that (e n ) n >i are i.i.d. random variables, each taking 
a value from {—1, 1} equiprobably. We write En = (ei, . . . , ejv)- We will split the proof of 
Theorem [T] into several parts. The main idea of the proof is to show that the value of the 
normality measure N is with large probability almost equal to a restricted normality measure 
J\fW considering only those occurrences of an pattern X for which the index set is entirely 
contained in a set of the form {md + 1, . . . , (m + l)d} for some m. For each m the block 
(e m d+i> . . . , e( m+1 )d) is a random element of {—1, and of course for different values of m 
these blocks are independent. 

To each possible block B G {—1, of d digits we will assign a vector of the standard (Carte- 
sian) base of M? d , and interpret the occurrence of a specific block B as a step forward of a 
random walk in the direction of the assigned coordinate (subtracting the expected value, that 
is the average of all possible movements, in each step). By Donsker's theorem this random 
walk (appropriately normalized) converges weakly to a Wiener process. 

The probability of this restricted normality measure exceeding a certain value then converges 
to the exiting probability of the limiting Wiener process from a certain polytope. This may 
sound surprising, but in fact is quite natural if one reflects on when the normality measure 
exceeds a certain value: the restricted normality measure M^ d \E^) is larger than t if for 
some possible pattern X the sum (over B G {— 1,1} ) of the number of occurrences of B 
multiplied with the number of occurrences of X within B is greater than t. 
This sum of products can be written as a scalar product of the position of the random walk, 
multiplied with a "weight" vector (counting the number of occurrences of X within B for each 
possible B), and the event of this scalar product exceeding a certain value t equals the event 
of the random walk exiting the space between two hyperplanes (depending on t). Taking the 
maximum over all possible values of X in the definition of the normality measure corresponds 
to taking the exiting probabilities of the random walk from the intersection of all possible 
hyperplanes associated to some X (which in our case in fact produces a proper polytope). 

Let d > 1 be given, and for m > set 

A m = {md + 1, . . . , (m + l)d}. 

Modifying the definition of Af(E]y) in ([1]), we define a restricted normality measure J\f^ (En) 
by setting 

T {d \E N ,M,X) 

= #{n: < n < M, (n mod d) G {0, . . . , d — A;}, and (e n +i, . . . , e n +p.) = X} 



and 



AA '(Em) = max max max 

k X M 



T^(E N ,M,X)- M + *-^' 1 - h + [ 



(6) 



where the maxima are taken over k < d, X G {-1, l} fc , 1 < M < N + 1 - k and M + 
k — 1 = mod d. That means that for J\f( d \E]\[) we consider only patterns X of length at 
most d, and only those occurrences (e n+ i, . . . , e n+ k) of such patterns for which the index set 
{n + 1, . . . , n + k} is entirely contained in a set A m for some m > (that means, the index set 
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{n + 1, . . . , n + k} does not overstep any integer multiple of d). The additional assumption 
M + k — 1 = mod d means that we only consider sets of indices {1, . . . , M + k} which end at 
an integer multiple of d, and can therefore be written as a union of sets A m ; considering only 
values M of this form accounts for an additional possible error of at most d in comparison 
with a normality measure defined without this additional restriction. 

Note that the number of possible index sets {n+1, . . . , n+k} satisfying n < M for M+k—1 = 
mod d, which do not overstep any integer multiple of d, is precisely (M + k — l)(d — k + l)/d 
(which accounts for the very last term in line ([6])). In contrast, the number of such index sets 
which do overstep an integer multiple of d is ( — l) (k — 1) (which accounts for the 

very last term in line ([7]) below). Naturally, 



(M + k- fc + 1) 

d 



+ 



M + k-1 
d 



1 )(k - 1) = M. 



Furthermore, we set 

f id \E N ,M,X) 

= < n < M, (n mod d) E {d — k + 1, . . . , d — 1}, and (e n +i, . . . , e n +fc) = -X"} . 

Thus f( d \E N ,M,X) counts the number of occurrences of a pattern X for which the index 
set is not entirely contained in A m for some m; clearly this means that 

T(E N ,M,X) =T^{E N ,M,X) +f {d \E N ,M,X). 



Consequently, N{E^) is bounded above by 

max M w (E N ) + d + max max max T' '{Em.M, X) 

{ k<d X M 



M + k-1 \ k 



2 k 



max 



max max 

M 



M 

T(E N ,M,X)--£ 



(7) 
(8) 



where in each line the maxima are taken over X E {—1, l} k and 1 < M < N + 1 — k, respec- 
tively, and in the last maximum in line ([7]) we have the additional restriction M + k — 1 = 
mod d. Here, to compare with M, the additional term d in line ([JJ) comes from the 

restriction M + k — 1 = mod d in the definition of Af^ d '(Ejsf), and the last term in line ([TJ 
adds the contribution of the occurrences of a pattern for which the index set is not entirely 
contained in A m for some m. The term in line ([8]) accounts for the possible contribution 
of patterns which consist of more than d digits. As we will see the contribution of the last 
term in (|7|) and of the term in (JSj) is with large probability very small, provided d is chosen 
sufficiently large. 

On the other hand, N{En) is bounded below by 



N^ d \E N ) 



max max max 

k<d X M 



TW(E N ,M,X) 



M + k-1 1 \ k - I 



d 



(9) 



where the second and the third maximum are taken over X £ {-1, l} k and 1 < M < N+l-k, 
respectively. 
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Let B denote the set {—1, l} d of all blocks of d digits, and let X denote the set of all possible 
patterns of at most d digits, that is all patterns X G Ufc<d{ — 1) Furthermore, for any 
B G B and X G X, let denote the number of occurrences of the pattern X within the 

block B, that is 

w x ,B = T(B,d-k + l,X). 

In the sequel, we will use the symbol "•" for the scalar product of two vectors, and for the 
product of a scalar and a vector. We will generally write vectors in bold font. 

We enumerate the 2 d elements of B by b\, . . . , b 2 d, and let (3 1 , . . . , (3 2 d denote the Cartesian 
base vectors of M 2 . Furthermore, writing 1 for the d-dimensional vector (1, . . . , 1), we set 

/ 2 d 

X m = I !&„ (e m d+i , • • • , e( m+ i)d) • f3 u 

\u=l 

Here t bu {e md+1 , e (m+1)(i ) is the indicator function of b u , meaning l bu (e md+1 , . . . , e (m+1)d ) = 
1 if (e md+ i, . . . , e (m+1 ) d ) = 6 n , and l& u (e m(i+ i, . . . , e (m+1 ) d ) = otherwise. Then X m is a ran- 
dom vector for m > 0, and X m = (3 U — 2~ d ■ 1 if and only if (e m d+i, . . . , e( m+1 ^) = The 
random vectors X m , m > 1, are independent, identically distributed, and have expectation 
zero. 

For any X G we set wx = (^x,f>i , • • • , wx,b u ) G I£ 2< \ and for t > define a polytope as 

J*) = p {y G M 2d : |w x -y|<t}. 

Note that p(*) is defined as the intersection of finitely many half-spaces; however, it is in fact 
a proper polytope, since for all I we have bp G X , and, since Wb it b u = 1 if and only if £ = u, 

u=l 

Consequently the absolute value of the ^-th coordinate of any element y G P^> is bounded by 
t, for £ G {1, . . . , 2 d }. In other words, P^ is contained in the 2 rf -dimensional cube [— t,t] 2 . 
Furthermore, is nonempty if t > 0. 



5 Main lemmas 

Lemma [3] below is the key ingredient in the proof, showing that the probability of the nor- 
malized restricted normality measure J\fW (En) / y/N exceeding the value t converges to the 
exiting probabilities of an appropriate Wiener process from the polytope P^ . The following 
Lemmas H] and \5\ state that the error made by approximating the normality measure J\f by 
the restricted normality measure is "small" , provided d is "large" . 
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Lemma 3. Let E be the covariance matrix of the random variables X m in (jlOp . Then for 
any tGl, 

( NV\E N ) . \ _ fZ(s) 



lim 



< t 



€ P W for all s £ [0, 1]^ , 



where Z is a d- dimensional Wiener process with mean zero and covariance matrix S 
Lemma 4. For any d>A and N > No(d) 



max max max 

k<d Xe{-l,l} fc l<M<7V+l-fc, 

M+fc-l=0 mod d 



T^ d) {E N: M,X) 



M + k-l ^ k - 1 



2 fe 



Lemma 5. For any d > 4 and iV > iVo(d) 



P max max max 

\d<k<log 2 N Xe{-l,l} k l<AI<N+l-k 



T(E N ,M,X) 



Vd 



M 



d 2 -1 



16ViV \ 1 

- ~ir < d2d' 



Proof of Lemma 0' The main part of the proof of Lemma [3] is to identify the probabilit 



P ( ^lJSMmI > ^ as the exiting probabilities of the random walk 



ICS 



N 



se [0, l], 



from the polytope PW. The convergence is then an immediate consequence of Donsker's 
theorem in Lemma [2j 

By the restriction on the values of M in the definition of AfW(Epf), we can assume that N 
is an integer multiple of d, which means that N = Rd for some R. Now 



Af^(E N ) > ty/N 



(11) 



if and only if there exists an pattern X £ X of k < d digits and anM<N + l — k satisfying 
M + k — 1 = mod d such that 



V jv, , ; 2fc ^ 
which happens if and only if 



> tVN, 



((M+k-l)/d 
^ 1&„ ( e (m-l)d+l ) • • • > e md) ^X,^ 
m=l 



1 (M + fc- l)(d- k + 1) 



2 fc 



d 



(M+fe-l)/d 



^— im— 1 



X m +2- d -l)-w x 



> i\ZjV. (12) 
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Some simple combinatorics shows that 



2 d 



J>x,6„ = wi • 1 = 2 d - k (d -k + 1), 



u=l 



and thus (|12j) is equivalent to 



(s-^(M+k-l)/d v \ 



> t, 



which furthermore is equivalent to 



max 

Ks<R 



Thus (|TT|) is equivalent to 



m =l_ m pW for 



> t. 



1 P 

some s G ^'•••'^ 



and consequently also equivalent to 

> \sN/d] 



By Donsker's theorem this implies that 



lim ] 



^ m=1 m pi*) for some s G [0, 1], 
v/iV 



(Ptv) > tVN^j = P (^^^ P w for some s G [0, 1]^ 
which proves the lemma. □ 



Proof of Lemma ^ We can again assume that N is divisible by d. Clearly, 



max max max 

k<d Xe{-l,l} fe M 



T(E N ,M,X) 



M + k-1 \ /. - 1 



fc<d xe{-i,i} k 



max 

M 



T(E N ,M,X) 



M + k-l 
d 



2 k 
k-1 



> 



2 k 



> 



Vd 

Gy/N^logd" 
~Td 



(13) 



where in both lines the last maximum is taken over 1 < M < N+l — k, M+k — 1 = mod d. 

Let k < d and X G {—1, be fixed. We introduce the notation 

f( d \E N ,M,X) = #{n- < n< M, n = r mod d, and (e n+ i, . . . , e n+fe ) = X} . (14) 
Then 

d-l 

T^(E N ,M,X)= TW(E N ,M,X), 



r= d-k+i 



and consequently 



max 

l<M<7V+l-fc, 
v M+fc-l=0 mod d 



T(E N ,M,X) 



1 (M + k— l)(k - 1) 



2k 



< 



d-l 

El" I max 
\ l<M<N+l-k, 
r=d-k+l \ M+k _i= mod d 



TW{E N ,M,X) 



d 



M + k-l 



> 



Vd 



2 k d 



> 



GVNyflogd 
kyd 



Note that for M satisfying M + fc — 1 = mod d we have 

(M+k-l) /d 

TW (E'jv, M, X) = ^ lx (e( m _i)d+ r+ i , • • • , e( m _!) d+r+fc ) . 

rri=l 

By the independence of (e n ) n >i the random variables tx (emd+r+l> ■ ■ ■ > e md+r+k) are a l so 
independent for 1 < m < N/d (they are constructed in such a way that the indices do not 
overlap for different values of m). The random variables tx {e-md+r+l, • • • , emd+r+k) — 2~ fc 
have mean zero and variance 2" fc (l - 2~ k ) < 2~ k . Using LemmaUfor t = 3v ^ 2 ~ fc/ ^ Fv/I °^ , 
we obtain 



max 

l<jW"<iV+l-fc, 
\M+k-l=0 mod d 



< 2 exp 



f^(E N ,M,X) 
9k2~ k Nd~ 1 logd 



M + k-l 



2 k d 



> 



3Vk~2- k / 2 VN^fhid 
Vd 



2Nd~ 1 2~ k + 6Vk2~ k / 2 VNy/logd dr x l 2 /3 



< 2cT 3fc 

for sufficiently large N. Note that for k > 1, 



3^2"^ < 

k 



and consequently 



max 

l<M<7V+l-fc, 
v M+fe-l=0 mod d 



Thus, using (jl3|) and the assumption that c? > 4, this implies 



max max max 
k<d Xe{-l,l} fe l<M<7V+l-fc, 

M+fc-l=0 mod d 



T(E N ,M,X) 



M + k-l \ !; 



2 k 



> 



GVN^gd 
Vd 



d-l 



E E 2<f 

fc<d Xe{-l,l} fc r=d-k 

<2 ^ 2 k kd~ 3k 



3/,- 
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<j2d- 

k<d 
1 

< 



■2A- 



□ 



~d 2 -l' 

Proof of Lemma E|r Similar to (|14p we define 

f( k \E N , M, X) = # {n- < n< M, n = r mod k, and (e n+1 , e n+k ) = X} 
and note that 

fc-i 

T(E N , M,X) = Y J fW(E Nt M, X) 

Then, as in (|13p . we have 



r=0 



max max max 

<2<fc<log 2 7V Xe{-l,l} fe l<M<N+l-k 



M 

T(E N ,M,X)- ¥ 



> 



< 



< 



E E 

rf<fc<io g2 iv xe{-i,i}* 



max 

l<M<AT+l-fe 



T(E N ,M,X) 



M 
2k 



> 



16VN 
~d~ 

16\/iV N 



d 



fc-i 



y y VP max 

z — ' ^— ' ^— ' \ KM<N+l-k 
d<Klog 2 N xg{-i,i}fcr=0 \~ ~ 



M 



f^(E N ,M,X)-— k 



Note that 



max 

!<M<iV+l-fc 



fW(E N ,M,X) 



M 



< — r + max 

2 fe l<Af<JV+l-Jfe, 
(M-r)/fceZ 



fW(E N ,M,X) 



M 



Again the functions f^ k) (E Nl M,X) are constructed in such a way that they are a sum of 
independent random variables, namely 

[(M-r)/k\ 

fW(E N ,M,X)= ]T 

^-X ( e (m-l)A;+r+l) ■ • • i e mk+r) ■ 

m=l 

Furthermore, the random variables tx (e m ^ +r+ i, . . . , e m (fc +1 ) +r ) — 2~ k have variance at most 
2~ k . Thus, using Lemma [U we obtain for any r and for t = 3 • 2~ k l 2 \f~N Vlog 3, 



P I max 

l<Af<JV+l-fc, 
(M-r)/k£Z 



< 2 exp 



f r ^(E N ,M,X)-^ 
9 • 2- fe A^logd 



> 3 • 2" fc / 2 y]V 0oid 



2Nk- 1 2- k + 6 • 2- fe / 2 v / AVlogr3 rf-1/2/3 



< 2d- 3k 

for sufficiently large N. Note that for > d > 4 we have 



16 
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and consequently 



P max 
\ KM<N+l-k 



M 



T^{E N ,M,X)-— k 



dk I 



Together with (|15j) this implies 



□ 



< 



( 



max max max 

\<J<fc<log 2 iV Xe{-l,l} fc l<M<N+l-k 

* E E XX 3fc 

d<fc<log 2 7V Xe{-l,l} fe r=0 



T(E N ,M,X) 



M 
2F 



> 



16VN 



d 



< 2 E d ~ 

k=d+l 
1 



2k 



d 2d- 



6 Proof of Theorem [T] 

To prove the existence of the limit distribution, we have to show the following: for any given 
e > and tGl there exists a number iVo = Nq (e) such that for N\ , N2 > iVo we have 

(16) 



N(En 2 ) < ty/N 2 ) < e 



By ([3]) the limit distribution exists for t = 0, and satisfies F(0) = 0. Thus we can assume in 
the sequel that t > 0. Let 5 > be fixed (and "small"), and choose d sufficiently large such 
that all the inequalities Qy/\ogd/\fd < 8/3, l/(d 2 - 1) < e/6, 16/d < <5/3 and l/d 2d < e/6 
hold. By (J7J) and (0) we have 

(17) 



M + k-l \ /,• - 1 



F[Af(E Nl ) < WNx 
< f(M (d \E Nl )<(t + 6)^N^ 

fW(E Nl ,M,X) 



+ P ( max max max 

\ fc<d X M 



d 



2 k 



> 8/3^ N r 



+ 



( 



max max max 

\d<k<log 2 NX M 



<e/6 by Lemma [1] 

M 

T(E Nl ,M,X)-^ 



> 8/3^/N^j 



<s/6 by Lemma [5] 

for sufficiently large N%. Similarly, by 

(M d HE N2 )<(t-6)y/N?) 



> 



12 



max max max 

k<d x M 



T«\E N2 ,M,X)-l M + k - 1 -l^ k - 1 



d 



2 k 



> 5JN 2 



<e/6 by Lemma [4] 

for sufficiently large N%. By Lemma [3] 

P (M {d) {E Nl ) <{t + <J)v^Vi) - P G p(t+S) f o r all a E [0, 1]^ 



and 



(a/" (c0 (£tv 2 ) < (t - *)v^) - IP E P<*-*> for all a E [0, 1]^ 



< e/6 

< e/6 



for sufficiently large iVi and iV~2, and since the exiting probabilities 



P (^^S E P {u) for all s G [0, 1]^ 



depend on the parameter u continuously this implies 



Z{s) 
Vd 



G for all s G [0, 1]^ - P (^=L G P (t ~ 5) for all s G [0, 1]^ < e/6 



if (5 is sufficiently small. Overall, we have established (|16|) . which proves the existence of the 
limit distribution of the normalized normality measure. 

Now we turn the continuity of the limit distribution function Fit). Let e > be given. We 
have to show that 

F(t + S) - F(t) < e (18) 

for some sufficiently small 5 = (5(e). The continuity of F(t) at t = follows from ([3]). Thus 
we can henceforth assume that t > 0. We have 



F{t + 8)- F{t) 

lim f(N(E n )/Vn < t + 5) - lim P [M{E N )/VN < t 



N^oo 



Arguing as in the lines following (|17p we can show that 

P (N(E n )/Vn < t + 5) < P G P (i+25) for all s G [0, 1]^ + e/3 



and 



(aT(E n )/VN < tj > P G P<*-*) for all a G [0, 1]^ - e/3 



for sufficiently large iV. Together with the fact that 



Z{s) 
Vd 



G P (t+25) for all s G [0, 1]^ - P (^l G P (t " 5) for all s G [0, 1]^ 



< e/3 



for sufficiently small 5, this establishes ()18p . Altogether, we have proved Theorem [TJ 
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